SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: CLOUGH et al - - - 

(B) STREET: National Institute for Medical Research, Mill 

Hill 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): NW7 1AA 

(ii) TITLE OF INVENTION: AN EF-TU PROTEIN ENCODED ON THE PLASTID DNA 
OF THE MALARIA PARASITE AND PROTEIN SYNTHESIS INHIBITORS 
EFFECTIVE AS ANTI- MALARIAL COMPOUNDS 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US NA 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
10 (A) NAME /KEY: CDS 

(B) LOCATION:!.. 1230 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

15 

ATG AAT AAT AAA TTA TTT TTA AGA AAT AAA CAA CAT ATA AAT TTA GGT 48 
Met Asn Asn Lys Leu Phe Leu Arg Asn Lys Gin His He Asn Leu Gly 
15 10 15 



20 ACT ATA GGG CAT GTA GAT CAT GGA AAA ACT ACA TTA ACA ACA GCT ATA 96 
Thr He Gly His Val Asp His Gly Lys Thr Thr Leu Thr Thr Ala He 
20 25 30 

TCT TAT TTA TTA AAT TTA CAA GGA TTA TCA AAA AAA TAT AAT TAT TCA 144 
25 Ser Tyr Leu Leu Asn Leu Gin Gly Leu Ser Lys Lys Tyr Asn Tyr Ser 
35 40 45 

GAT ATT GAT TCA GCT CCA GAA GAA AAA ATA AGA GGT ATT ACA ATA AAT 192 
Asp He Asp Ser Ala Pro Glu Glu Lys He Arg Gly He Thr He Asn 
30 50 55 • 60 

ACA ACA CAT ATT GAA TAT GAA ACT TTA ACA AAA CAT TGT GCT CAT ATA 240 
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Thr Thr His He Glu Tyr Glu Thr Leu Thr Lys His Cys Ala His He 
65 70 75 80 



GAT TGT CCA GGA CAT TCC GAT TAT ATT AAA AAT ATG ATT ATA GGA GCC 
Asp Cys Pro Gly His Ser Asp Tyr He Lys Asn Met He lie Gly Ala 
85 90 95 



288 



10 



ACA CAA ATG GAT ATA GCA ATT TTA GTA ATA TCT ATA ATA GAT GGT ATA 
Thr Gin Met Asp He Ala He Leu Val He Ser He He Asp Gly He 
100 105 110 



336 



15 



ATG CCT CAA ACT TAT GAA CAT TTA TTA TTA ATA AAA CAA ATA GGT ATA 
Met Pro Gin Thr Tyr Glu His Leu Leu Leu He Lys Gin He Gly He 
115 120 125 

AAA AAT ATA ATT ATT TTT TTA AAT AAA GAA GAT TTA TGT GAT GAT GTT 
Lys Asn He He He Phe Leu Asn Lys Glu Asp Leu Cys Asp Asp Val 
130 135 140 



384 



432 



20 GAA TTA ATA GAT TTT ATA AAA TTA GAA GTA AAT GAA TTA TTA ATT AAA 
Glu Leu He Asp Phe He Lys Leu Glu Val Asn Glu Leu Leu He Lys 
145 150 155 160 



480 



TAT AAT TTT GAT TTA AAT TAT ATA CAT ATA TTA ACT GGT TCA GCA TTA 
25 Tyr Asn Phe Asp Leu Asn Tyr He His He Leu Thr Gly Ser Ala Leu 

165 170 175 



528 



30 



AAT GTA ATA AAT ATA ATT CAA AAA AAT AAG GAT TAT GAA TTA ATA AAA 
Asn Val He Asn He He Gin Lys Asn Lys Asp Tyr Glu Leu He Lys 
180 185 190 



576 



TCT AAT ATT TGG ATA CAA AAA TTA AAT AAT TTA ATT CAA ATA ATT GAT 



-35- 



624 



Ser Asn He Trp He Gin Lys Leu Asn Asn Leu He Gin He He Asp 
195 200 205 



AAT ATT ATA ATA CCT ACT AGA AAA ATT AAT GAT TAC TTT TTA ATG TCA 
Asn He He He Pro Thr Arg Lys He Asn Asp Tyr Phe Leu Met Ser 
210 215 220 



672 



10 



ATA GAA GAT GTA TTT TCT ATA ACA GGT AGA GGT ACA GTA GTA ACA GGT 
lie Glu Asp Val Phe Ser He Thr Gly Arg Gly Thr Val Val Thr Gly 
225 230 235 240 



720 



15 



AAG ATT GAA CAA GGA TGT ATA AAT TTA AAT GAT GAA ATT GAA ATT TTA 
Lys He Glu Gin Gly Cys He Asn Leu Asn Asp Glu He Glu lie Leu 
245 250 255 

AAA TTT GAA AAA TCA TCT CCT AAT TTA ACA ACA GTT ATA GGA TTA GAA 
Lys Phe Glu Lys Ser Ser Pro Asn Leu Thr Thr Val He Gly Leu Glu 
260 265 270 



768 



816 



20 ATG TTT AAA AAA CAA TTA ACA CAA GCA CAA TCC GGA GAT AAT GTA GGT 
Met Phe Lys Lys Gin Leu Thr Gin Ala Gin Ser Gly Asp Asn Val Gly 
275 280 285 



864 



ATT TTA TTA AGA AAT ATT CAA AAA AAA GAT ATA AAA AGA GGT ATG ATT 
25 He Leu Leu Arg Asn He Gin Lys Lys Asp He Lys Arg Gly Met lie 
290 295 300 



912 



TTA GCA ACA CCT AAT AAA TTA AAA GTA TAT AAG TCT TTT ATA GCT GAA 
Leu Ala Thr Pro Asn Lys Leu Lys Val Tyr Lys Ser Phe He Ala Glu 
30 305 310 315 320 



960 



ACA TAT ATT TTA ACT AAA GAA GAA GGT GGT CGT CAT AAA CCT TTT AAT 
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1008 



• # 

Thr Tyr He Leu Thr Lys Glu Glu Gly Gly Arg His Lys Pro Phe Asn 
325 330 335 

ATT GGA TAT AAA CCT CAA TTT TTT ATT CGT ACA GTA GAT GTT ACT GGA 1056 
5 lie Gly Tyr Lys Pro Gin Phe Phe He Arg Thr Val Asp Val Thr Gly 
340 345 350 



GAA ATT AAA AAT ATA TAT TTA AAT GAA AAT GTA CAA AAA GTA GC-T ATA 1104 
Glu He Lys Asn He Tyr Leu Asn Glu Asn Val Gin Lys Val Ala He 
10 355 360 365 

CCT GGA GAT AAA ATA ACA TTA CAT ATT GAA TTA AAA CAT TAT ATA GTG 1152 

Pro Gly Asp Lys He Thr Leu His He Glu Leu Lys His Tyr He Val 
370 375 380 

15 

TTG ACA TTA AAT ATG AAA TTT TCT ATT AGA GAA GGA GGA AAA ACA ATA 1200 

Leu Thr Leu Asn Met Lys Phe Ser He Arg Glu Gly Gly Lys Thr He 
385 390 395 400 



20 GGA GCA GGT ATT ATA ACA GAA ATA AAA AAT 
Gly Ala Gly He He Thr Glu He Lys Asn 
405 410 



1230 



25 (2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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1 • 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asn Asn Lys Leu Phe Leu Arg Asn Lys Gin His lie Asn Leu Gly 
15 10 15 

5 

Thr He Gly His Val Asp His Gly Lys Thr Thr Leu Thr Thr Ala He 
20 25 30 

Ser Tyr Leu Leu Asn Leu Gin Gly Leu Ser Lys Lys Tyr Asn Tyr Ser 
10 35 40 45 

Asp He Asp Ser Ala Pro Glu Glu Lys He Arg Gly He Thr lie Asn 
50 55 60 

15 Thr Thr His lie Glu Tyr Glu Thr Leu Thr Lys His Cys Ala His He 
65 70 75 80 

Asp Cys Pro Gly His Ser Asp Tyr lie Lys Asn Met He He Gly Ala 
85 90 95 

20 

Thr Gin Met Asp He Ala He Leu Val He Ser lie He Asp Gly He 
100 105 110 

Met Pro Gin Thr Tyr Glu His Leu Leu Leu He Lys Gin He Gly He 
25 115 120 125 

Lys Asn He He He Phe Leu Asn Lys Glu Asp Leu Cys. Asp Asp Val 
130 135 140 

30 Glu Leu He Asp Phe He Lys Leu Glu Val Asn Glu Leu Leu He Lys 
145 150 155 160 
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• ' Q # 

Tyr Asn Phe Asp Leu Asn Tyr lie His He Leu Thr Gly Ser Ala Leu 
165 170 175 

Asn Val lie Asn lie lie Gin Lys Asn Lys Asp Tyr Glu Leu He Lys 
5 180 185 190 

Ser Asn He Trp He Gin Lys Leu Asn Asn Leu He Gin He He Asp 
195 200 205 

10 Asn He lie He Pro Thr Arg Lys lie Asn Asp Tyr Phe Leu Met Ser 
210 215 220 

He Glu Asp Val Phe Ser He Thr Gly Arg Gly Thr Val Val Thr Gly 
225 230 235 240 

15 

Lys lie Glu Gin Gly Cys He Asn Leu Asn Asp Glu He Glu He Leu 
245 250 255 

Lys Phe Glu Lys Ser Ser Pro Asn Leu Thr Thr Val He Gly Leu Glu 
20 260 265 270 

Met Phe Lys Lys Gin Leu Thr Gin Ala Gin Ser Gly Asp Asn Val Gly 
275 280 285 

25 He Leu Leu Arg Asn He Gin Lys Lys Asp He Lys Arg Gly Met He 
290 295 300 

Leu Ala Thr Pro Asn Lys Leu Lys Val Tyr Lys Ser Phe He Ala Glu 
305 310 315 320 

30 

Thr Tyr He Leu Thr Lys Glu Glu Gly Gly Arg His Lys Pro Phe Asn 
325 330 335 
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He Gly Tyr Lys Pro Gin Phe Phe He Arg Thr Val Asp Val Thr Gly 
340 345 350 

Glu lie Lys Asn lie Tyr Leu Asn Glu Asn Val Gin Lys Val Ala He 
355 360 365 

Pro Gly Asp Lys He Thr Leu His He Glu Leu Lys His Tyr He Val 
370 375 380 

Leu Thr Leu Asn Met Lys Phe Ser He Arg Glu Gly Gly Lys Thr He 
385 390 395 400 

Gly Ala Gly He He Thr Glu He Lys Asn 
405 410 
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